Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 9579 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.6 MiB |
| Average record size in memory | 281.5 B |
Variable types
| Categorical | 1 |
|---|---|
| Text | 3 |
| Numeric | 7 |
Assault is highly overall correlated with Burglary and 5 other fields | High correlation |
Burglary is highly overall correlated with Assault and 5 other fields | High correlation |
Murder is highly overall correlated with Assault and 4 other fields | High correlation |
Rape is highly overall correlated with Assault and 4 other fields | High correlation |
Robbery is highly overall correlated with Assault and 5 other fields | High correlation |
Theft is highly overall correlated with Assault and 5 other fields | High correlation |
Vehicle_Theft is highly overall correlated with Assault and 5 other fields | High correlation |
Murder is highly skewed (γ1 = 31.39612644) | Skewed |
Rape is highly skewed (γ1 = 23.8876975) | Skewed |
Robbery is highly skewed (γ1 = 31.40834843) | Skewed |
Assault is highly skewed (γ1 = 34.01461082) | Skewed |
Theft is highly skewed (γ1 = 22.74568192) | Skewed |
Vehicle_Theft is highly skewed (γ1 = 20.43984571) | Skewed |
Murder has 7655 (79.9%) zeros | Zeros |
Rape has 4306 (45.0%) zeros | Zeros |
Robbery has 4027 (42.0%) zeros | Zeros |
Assault has 1554 (16.2%) zeros | Zeros |
Burglary has 686 (7.2%) zeros | Zeros |
Theft has 345 (3.6%) zeros | Zeros |
Vehicle_Theft has 1931 (20.2%) zeros | Zeros |
Reproduction
| Analysis started | 2024-09-19 21:23:04.072026 |
|---|---|
| Analysis finished | 2024-09-19 21:23:05.959195 |
| Duration | 1.89 second |
| Software version | ydata-profiling vv4.10.0 |
| Download configuration | config.json |
Region
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 518.9 KiB |
| South | |
|---|---|
| Midwest | |
| Northeast | |
| West |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.457459 |
| Min length | 4 |
Characters and Unicode
| Total characters | 61856 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | South |
|---|---|
| 2nd row | South |
| 3rd row | South |
| 4th row | South |
| 5th row | South |
Common Values
| Value | Count | Frequency (%) |
| South | 3038 | |
| Midwest | 2829 | |
| Northeast | 2403 | |
| West | 1309 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| south | 3038 | |
| midwest | 2829 | |
| northeast | 2403 | |
| west | 1309 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 11982 | |
| e | 6541 | |
| s | 6541 | |
| o | 5441 | |
| h | 5441 | |
| S | 3038 | 4.9% |
| u | 3038 | 4.9% |
| M | 2829 | 4.6% |
| i | 2829 | 4.6% |
| d | 2829 | 4.6% |
| Other values (5) | 11347 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 61856 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 11982 | |
| e | 6541 | |
| s | 6541 | |
| o | 5441 | |
| h | 5441 | |
| S | 3038 | 4.9% |
| u | 3038 | 4.9% |
| M | 2829 | 4.6% |
| i | 2829 | 4.6% |
| d | 2829 | 4.6% |
| Other values (5) | 11347 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 61856 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 11982 | |
| e | 6541 | |
| s | 6541 | |
| o | 5441 | |
| h | 5441 | |
| S | 3038 | 4.9% |
| u | 3038 | 4.9% |
| M | 2829 | 4.6% |
| i | 2829 | 4.6% |
| d | 2829 | 4.6% |
| Other values (5) | 11347 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 61856 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 11982 | |
| e | 6541 | |
| s | 6541 | |
| o | 5441 | |
| h | 5441 | |
| S | 3038 | 4.9% |
| u | 3038 | 4.9% |
| M | 2829 | 4.6% |
| i | 2829 | 4.6% |
| d | 2829 | 4.6% |
| Other values (5) | 11347 |
State
Text
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 539.0 KiB |
Length
| Max length | 20 |
|---|---|
| Median length | 13 |
| Mean length | 8.6057 |
| Min length | 4 |
Characters and Unicode
| Total characters | 82434 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ALABAMA |
|---|---|
| 2nd row | ALABAMA |
| 3rd row | ALABAMA |
| 4th row | ALABAMA |
| 5th row | ALABAMA |
| Value | Count | Frequency (%) |
| new | 1089 | 9.7% |
| pennsylvania | 827 | 7.4% |
| texas | 625 | 5.6% |
| illinois | 532 | 4.7% |
| jersey | 486 | 4.3% |
| california | 461 | 4.1% |
| missouri | 418 | 3.7% |
| michigan | 416 | 3.7% |
| york | 387 | 3.5% |
| ohio | 368 | 3.3% |
| Other values (45) | 5599 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 10134 | |
| N | 9428 | |
| I | 9289 | |
| S | 7449 | 9.0% |
| O | 6432 | 7.8% |
| E | 6351 | 7.7% |
| R | 3973 | 4.8% |
| L | 3812 | 4.6% |
| T | 3042 | 3.7% |
| C | 2510 | 3.0% |
| Other values (16) | 20014 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 82434 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 10134 | |
| N | 9428 | |
| I | 9289 | |
| S | 7449 | 9.0% |
| O | 6432 | 7.8% |
| E | 6351 | 7.7% |
| R | 3973 | 4.8% |
| L | 3812 | 4.6% |
| T | 3042 | 3.7% |
| C | 2510 | 3.0% |
| Other values (16) | 20014 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 82434 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 10134 | |
| N | 9428 | |
| I | 9289 | |
| S | 7449 | 9.0% |
| O | 6432 | 7.8% |
| E | 6351 | 7.7% |
| R | 3973 | 4.8% |
| L | 3812 | 4.6% |
| T | 3042 | 3.7% |
| C | 2510 | 3.0% |
| Other values (16) | 20014 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 82434 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 10134 | |
| N | 9428 | |
| I | 9289 | |
| S | 7449 | 9.0% |
| O | 6432 | 7.8% |
| E | 6351 | 7.7% |
| R | 3973 | 4.8% |
| L | 3812 | 4.6% |
| T | 3042 | 3.7% |
| C | 2510 | 3.0% |
| Other values (16) | 20014 |
City
Text
| Distinct | 7445 |
|---|---|
| Distinct (%) | 77.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 553.2 KiB |
Length
| Max length | 44 |
|---|---|
| Median length | 38 |
| Mean length | 10.120159 |
| Min length | 3 |
Characters and Unicode
| Total characters | 96941 |
|---|---|
| Distinct characters | 66 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6366 ? |
|---|---|
| Unique (%) | 66.5% |
Sample
| 1st row | Abbeville |
|---|---|
| 2nd row | Adamsville |
| 3rd row | Addison |
| 4th row | Alabaster |
| 5th row | Albertville |
| Value | Count | Frequency (%) |
| township | 669 | 4.9% |
| village | 250 | 1.8% |
| city | 231 | 1.7% |
| county | 140 | 1.0% |
| town | 135 | 1.0% |
| lake | 131 | 1.0% |
| park | 121 | 0.9% |
| west | 118 | 0.9% |
| beach | 93 | 0.7% |
| new | 92 | 0.7% |
| Other values (5863) | 11621 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8702 | 9.0% |
| o | 7365 | 7.6% |
| n | 7322 | 7.6% |
| a | 7251 | 7.5% |
| l | 6481 | 6.7% |
| i | 6062 | 6.3% |
| r | 5834 | 6.0% |
| t | 4747 | 4.9% |
| s | 4111 | 4.2% |
| 4023 | 4.1% | |
| Other values (56) | 35043 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 96941 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 8702 | 9.0% |
| o | 7365 | 7.6% |
| n | 7322 | 7.6% |
| a | 7251 | 7.5% |
| l | 6481 | 6.7% |
| i | 6062 | 6.3% |
| r | 5834 | 6.0% |
| t | 4747 | 4.9% |
| s | 4111 | 4.2% |
| 4023 | 4.1% | |
| Other values (56) | 35043 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 96941 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 8702 | 9.0% |
| o | 7365 | 7.6% |
| n | 7322 | 7.6% |
| a | 7251 | 7.5% |
| l | 6481 | 6.7% |
| i | 6062 | 6.3% |
| r | 5834 | 6.0% |
| t | 4747 | 4.9% |
| s | 4111 | 4.2% |
| 4023 | 4.1% | |
| Other values (56) | 35043 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 96941 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 8702 | 9.0% |
| o | 7365 | 7.6% |
| n | 7322 | 7.6% |
| a | 7251 | 7.5% |
| l | 6481 | 6.7% |
| i | 6062 | 6.3% |
| r | 5834 | 6.0% |
| t | 4747 | 4.9% |
| s | 4111 | 4.2% |
| 4023 | 4.1% | |
| Other values (56) | 35043 |
Population
Text
| Distinct | 7602 |
|---|---|
| Distinct (%) | 79.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 498.7 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.2980478 |
| Min length | 0 |
Characters and Unicode
| Total characters | 41171 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6118 ? |
|---|---|
| Unique (%) | 63.9% |
Sample
| 1st row | 2608 |
|---|---|
| 2nd row | 4377 |
| 3rd row | 738 |
| 4th row | 33040 |
| 5th row | 21525 |
| Value | Count | Frequency (%) |
| 1392 | 7 | 0.1% |
| 1619 | 6 | 0.1% |
| 1313 | 6 | 0.1% |
| 1401 | 6 | 0.1% |
| 1213 | 6 | 0.1% |
| 1004 | 6 | 0.1% |
| 1655 | 5 | 0.1% |
| 972 | 5 | 0.1% |
| 866 | 5 | 0.1% |
| 2108 | 5 | 0.1% |
| Other values (7591) | 9519 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 6242 | |
| 2 | 4959 | |
| 3 | 4311 | |
| 4 | 4155 | |
| 5 | 3840 | |
| 6 | 3654 | |
| 7 | 3638 | |
| 8 | 3578 | |
| 9 | 3410 | |
| 0 | 3384 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 41171 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 6242 | |
| 2 | 4959 | |
| 3 | 4311 | |
| 4 | 4155 | |
| 5 | 3840 | |
| 6 | 3654 | |
| 7 | 3638 | |
| 8 | 3578 | |
| 9 | 3410 | |
| 0 | 3384 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 41171 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 6242 | |
| 2 | 4959 | |
| 3 | 4311 | |
| 4 | 4155 | |
| 5 | 3840 | |
| 6 | 3654 | |
| 7 | 3638 | |
| 8 | 3578 | |
| 9 | 3410 | |
| 0 | 3384 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 41171 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 6242 | |
| 2 | 4959 | |
| 3 | 4311 | |
| 4 | 4155 | |
| 5 | 3840 | |
| 6 | 3654 | |
| 7 | 3638 | |
| 8 | 3578 | |
| 9 | 3410 | |
| 0 | 3384 |
Murder
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 83 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3154818 |
| Minimum | 0 |
|---|---|
| Maximum | 765 |
| Zeros | 7655 |
| Zeros (%) | 79.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 75.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 3 |
| Maximum | 765 |
| Range | 765 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 13.081593 |
|---|---|
| Coefficient of variation (CV) | 9.9443358 |
| Kurtosis | 1425.0552 |
| Mean | 1.3154818 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 31.396126 |
| Sum | 12601 |
| Variance | 171.12807 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7655 | |
| 1 | 977 | 10.2% |
| 2 | 332 | 3.5% |
| 3 | 155 | 1.6% |
| 4 | 95 | 1.0% |
| 5 | 67 | 0.7% |
| 6 | 37 | 0.4% |
| 7 | 33 | 0.3% |
| 8 | 27 | 0.3% |
| 10 | 25 | 0.3% |
| Other values (73) | 176 | 1.8% |
| Value | Count | Frequency (%) |
| 0 | 7655 | |
| 1 | 977 | 10.2% |
| 2 | 332 | 3.5% |
| 3 | 155 | 1.6% |
| 4 | 95 | 1.0% |
| 5 | 67 | 0.7% |
| 6 | 37 | 0.4% |
| 7 | 33 | 0.3% |
| 8 | 27 | 0.3% |
| 9 | 21 | 0.2% |
| Value | Count | Frequency (%) |
| 765 | 1 | |
| 335 | 1 | |
| 318 | 1 | |
| 303 | 1 | |
| 301 | 1 | |
| 293 | 1 | |
| 273 | 1 | |
| 196 | 1 | |
| 188 | 1 | |
| 174 | 1 |
Rape
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 190 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.7759683 |
| Minimum | 0 |
|---|---|
| Maximum | 2372 |
| Zeros | 4306 |
| Zeros (%) | 45.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 75.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 29 |
| Maximum | 2372 |
| Range | 2372 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 57.125403 |
|---|---|
| Coefficient of variation (CV) | 6.5092992 |
| Kurtosis | 775.25693 |
| Mean | 8.7759683 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 23.887698 |
| Sum | 84065 |
| Variance | 3263.3117 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4306 | |
| 1 | 1345 | 14.0% |
| 2 | 718 | 7.5% |
| 3 | 526 | 5.5% |
| 4 | 342 | 3.6% |
| 5 | 292 | 3.0% |
| 6 | 234 | 2.4% |
| 7 | 166 | 1.7% |
| 8 | 140 | 1.5% |
| 9 | 136 | 1.4% |
| Other values (180) | 1374 | 14.3% |
| Value | Count | Frequency (%) |
| 0 | 4306 | |
| 1 | 1345 | 14.0% |
| 2 | 718 | 7.5% |
| 3 | 526 | 5.5% |
| 4 | 342 | 3.6% |
| 5 | 292 | 3.0% |
| 6 | 234 | 2.4% |
| 7 | 166 | 1.7% |
| 8 | 140 | 1.5% |
| 9 | 136 | 1.4% |
| Value | Count | Frequency (%) |
| 2372 | 1 | |
| 2343 | 1 | |
| 1589 | 1 | |
| 1259 | 1 | |
| 1210 | 1 | |
| 1200 | 1 | |
| 1190 | 1 | |
| 1019 | 1 | |
| 867 | 1 | |
| 767 | 1 |
Robbery
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 347 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.139367 |
| Minimum | 0 |
|---|---|
| Maximum | 15544 |
| Zeros | 4027 |
| Zeros (%) | 42.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 75.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 6 |
| 95-th percentile | 71 |
| Maximum | 15544 |
| Range | 15544 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 304.80359 |
|---|---|
| Coefficient of variation (CV) | 10.460199 |
| Kurtosis | 1249.6535 |
| Mean | 29.139367 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 31.408348 |
| Sum | 279126 |
| Variance | 92905.228 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4027 | |
| 1 | 1315 | 13.7% |
| 2 | 671 | 7.0% |
| 3 | 465 | 4.9% |
| 4 | 306 | 3.2% |
| 5 | 257 | 2.7% |
| 6 | 201 | 2.1% |
| 7 | 184 | 1.9% |
| 8 | 137 | 1.4% |
| 9 | 112 | 1.2% |
| Other values (337) | 1904 |
| Value | Count | Frequency (%) |
| 0 | 4027 | |
| 1 | 1315 | 13.7% |
| 2 | 671 | 7.0% |
| 3 | 465 | 4.9% |
| 4 | 306 | 3.2% |
| 5 | 257 | 2.7% |
| 6 | 201 | 2.1% |
| 7 | 184 | 1.9% |
| 8 | 137 | 1.4% |
| 9 | 112 | 1.2% |
| Value | Count | Frequency (%) |
| 15544 | 1 | |
| 11957 | 1 | |
| 10307 | 1 | |
| 9962 | 1 | |
| 6199 | 1 | |
| 5236 | 1 | |
| 4974 | 1 | |
| 4604 | 1 | |
| 3976 | 1 | |
| 3283 | 1 |
Assault
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 496 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60.900407 |
| Minimum | 0 |
|---|---|
| Maximum | 30873 |
| Zeros | 1554 |
| Zeros (%) | 16.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 75.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 6 |
| Q3 | 22.5 |
| 95-th percentile | 167 |
| Maximum | 30873 |
| Range | 30873 |
| Interquartile range (IQR) | 21.5 |
Descriptive statistics
| Standard deviation | 511.97978 |
|---|---|
| Coefficient of variation (CV) | 8.4068368 |
| Kurtosis | 1640.4193 |
| Mean | 60.900407 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 34.014611 |
| Sum | 583365 |
| Variance | 262123.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1554 | 16.2% |
| 1 | 903 | 9.4% |
| 2 | 702 | 7.3% |
| 3 | 567 | 5.9% |
| 4 | 474 | 4.9% |
| 5 | 361 | 3.8% |
| 6 | 321 | 3.4% |
| 7 | 289 | 3.0% |
| 8 | 234 | 2.4% |
| 9 | 211 | 2.2% |
| Other values (486) | 3963 |
| Value | Count | Frequency (%) |
| 0 | 1554 | |
| 1 | 903 | |
| 2 | 702 | |
| 3 | 567 | 5.9% |
| 4 | 474 | 4.9% |
| 5 | 361 | 3.8% |
| 6 | 321 | 3.4% |
| 7 | 289 | 3.0% |
| 8 | 234 | 2.4% |
| 9 | 211 | 2.2% |
| Value | Count | Frequency (%) |
| 30873 | 1 | |
| 15874 | 1 | |
| 15815 | 1 | |
| 12487 | 1 | |
| 9882 | 1 | |
| 8029 | 1 | |
| 7803 | 1 | |
| 7188 | 1 | |
| 7183 | 1 | |
| 7099 | 1 |
Burglary
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 716 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 107.80635 |
| Minimum | 0 |
|---|---|
| Maximum | 18488 |
| Zeros | 686 |
| Zeros (%) | 7.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 75.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 16 |
| Q3 | 57 |
| 95-th percentile | 360.2 |
| Maximum | 18488 |
| Range | 18488 |
| Interquartile range (IQR) | 52 |
Descriptive statistics
| Standard deviation | 564.72441 |
|---|---|
| Coefficient of variation (CV) | 5.2383225 |
| Kurtosis | 381.5097 |
| Mean | 107.80635 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | 17.09362 |
| Sum | 1032677 |
| Variance | 318913.66 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 686 | 7.2% |
| 1 | 483 | 5.0% |
| 2 | 401 | 4.2% |
| 3 | 390 | 4.1% |
| 4 | 339 | 3.5% |
| 5 | 334 | 3.5% |
| 6 | 289 | 3.0% |
| 8 | 265 | 2.8% |
| 7 | 254 | 2.7% |
| 9 | 231 | 2.4% |
| Other values (706) | 5907 |
| Value | Count | Frequency (%) |
| 0 | 686 | |
| 1 | 483 | |
| 2 | 401 | |
| 3 | 390 | |
| 4 | 339 | |
| 5 | 334 | |
| 6 | 289 | |
| 7 | 254 | 2.7% |
| 8 | 265 | 2.8% |
| 9 | 231 | 2.4% |
| Value | Count | Frequency (%) |
| 18488 | 1 | |
| 15821 | 1 | |
| 14258 | 1 | |
| 13024 | 1 | |
| 12500 | 1 | |
| 12235 | 1 | |
| 12041 | 1 | |
| 10948 | 1 | |
| 10209 | 1 | |
| 9150 | 1 |
Theft
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 1529 |
|---|---|
| Distinct (%) | 16.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 441.41988 |
| Minimum | 0 |
|---|---|
| Maximum | 106868 |
| Zeros | 345 |
| Zeros (%) | 3.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 75.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 19 |
| median | 73 |
| Q3 | 279 |
| 95-th percentile | 1488.1 |
| Maximum | 106868 |
| Range | 106868 |
| Interquartile range (IQR) | 260 |
Descriptive statistics
| Standard deviation | 2310.3695 |
|---|---|
| Coefficient of variation (CV) | 5.2339499 |
| Kurtosis | 749.72279 |
| Mean | 441.41988 |
| Median Absolute Deviation (MAD) | 66 |
| Skewness | 22.745682 |
| Sum | 4228361 |
| Variance | 5337807.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 345 | 3.6% |
| 2 | 139 | 1.5% |
| 1 | 137 | 1.4% |
| 4 | 133 | 1.4% |
| 5 | 127 | 1.3% |
| 7 | 123 | 1.3% |
| 3 | 121 | 1.3% |
| 6 | 119 | 1.2% |
| 12 | 114 | 1.2% |
| 11 | 113 | 1.2% |
| Other values (1519) | 8108 |
| Value | Count | Frequency (%) |
| 0 | 345 | |
| 1 | 137 | 1.4% |
| 2 | 139 | |
| 3 | 121 | 1.3% |
| 4 | 133 | 1.4% |
| 5 | 127 | 1.3% |
| 6 | 119 | 1.2% |
| 7 | 123 | 1.3% |
| 8 | 106 | 1.1% |
| 9 | 100 | 1.0% |
| Value | Count | Frequency (%) |
| 106868 | 1 | |
| 69630 | 1 | |
| 64739 | 1 | |
| 61229 | 1 | |
| 58318 | 1 | |
| 37568 | 1 | |
| 36991 | 1 | |
| 36856 | 1 | |
| 27280 | 1 | |
| 26639 | 1 |
Vehicle_Theft
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 523 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 61.32018 |
| Minimum | 0 |
|---|---|
| Maximum | 18591 |
| Zeros | 1931 |
| Zeros (%) | 20.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 75.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 4 |
| Q3 | 18 |
| 95-th percentile | 174.1 |
| Maximum | 18591 |
| Range | 18591 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 436.48317 |
|---|---|
| Coefficient of variation (CV) | 7.1181 |
| Kurtosis | 581.47736 |
| Mean | 61.32018 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 20.439846 |
| Sum | 587386 |
| Variance | 190517.56 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1931 | |
| 1 | 1109 | 11.6% |
| 2 | 747 | 7.8% |
| 3 | 610 | 6.4% |
| 4 | 475 | 5.0% |
| 5 | 391 | 4.1% |
| 6 | 327 | 3.4% |
| 7 | 244 | 2.5% |
| 8 | 220 | 2.3% |
| 9 | 188 | 2.0% |
| Other values (513) | 3337 |
| Value | Count | Frequency (%) |
| 0 | 1931 | |
| 1 | 1109 | |
| 2 | 747 | 7.8% |
| 3 | 610 | 6.4% |
| 4 | 475 | 5.0% |
| 5 | 391 | 4.1% |
| 6 | 327 | 3.4% |
| 7 | 244 | 2.5% |
| 8 | 220 | 2.3% |
| 9 | 188 | 2.0% |
| Value | Count | Frequency (%) |
| 18591 | 1 | |
| 12738 | 1 | |
| 11473 | 1 | |
| 8905 | 1 | |
| 8855 | 1 | |
| 7960 | 1 | |
| 7710 | 1 | |
| 7703 | 1 | |
| 7592 | 1 | |
| 7233 | 1 |
| Assault | Burglary | Murder | Rape | Region | Robbery | Theft | Vehicle_Theft | |
|---|---|---|---|---|---|---|---|---|
| Assault | 1.000 | 0.827 | 0.526 | 0.659 | 0.004 | 0.759 | 0.814 | 0.786 |
| Burglary | 0.827 | 1.000 | 0.538 | 0.662 | 0.036 | 0.813 | 0.904 | 0.859 |
| Murder | 0.526 | 0.538 | 1.000 | 0.409 | 0.000 | 0.547 | 0.519 | 0.523 |
| Rape | 0.659 | 0.662 | 0.409 | 1.000 | 0.027 | 0.603 | 0.676 | 0.642 |
| Region | 0.004 | 0.036 | 0.000 | 0.027 | 1.000 | 0.004 | 0.028 | 0.038 |
| Robbery | 0.759 | 0.813 | 0.547 | 0.603 | 0.004 | 1.000 | 0.824 | 0.792 |
| Theft | 0.814 | 0.904 | 0.519 | 0.676 | 0.028 | 0.824 | 1.000 | 0.863 |
| Vehicle_Theft | 0.786 | 0.859 | 0.523 | 0.642 | 0.038 | 0.792 | 0.863 | 1.000 |
| Region | State | City | Population | Murder | Rape | Robbery | Assault | Burglary | Theft | Vehicle_Theft | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | South | ALABAMA | Abbeville | 2608 | 0 | 1 | 0 | 10 | 12 | 34 | 5 |
| 1 | South | ALABAMA | Adamsville | 4377 | 0 | 0 | 10 | 9 | 33 | 201 | 16 |
| 2 | South | ALABAMA | Addison | 738 | 0 | 0 | 0 | 1 | 1 | 11 | 2 |
| 3 | South | ALABAMA | Alabaster | 33040 | 1 | 2 | 2 | 92 | 58 | 411 | 19 |
| 4 | South | ALABAMA | Albertville | 21525 | 0 | 5 | 10 | 14 | 190 | 462 | 69 |
| 5 | South | ALABAMA | Alexander City | 14695 | 2 | 4 | 14 | 257 | 123 | 493 | 41 |
| 6 | South | ALABAMA | Aliceville | 2362 | 0 | 0 | 0 | 5 | 10 | 14 | 0 |
| 7 | South | ALABAMA | Andalusia | 9071 | 1 | 6 | 6 | 71 | 64 | 397 | 16 |
| 8 | South | ALABAMA | Anniston | 22205 | 7 | 39 | 87 | 602 | 712 | 888 | 112 |
| 9 | South | ALABAMA | Arab | 8334 | 1 | 11 | 3 | 55 | 113 | 366 | 32 |
| Region | State | City | Population | Murder | Rape | Robbery | Assault | Burglary | Theft | Vehicle_Theft | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 9569 | West | WYOMING | Rawlins | 9004 | 0 | 6 | 1 | 10 | 34 | 228 | 18 |
| 9570 | West | WYOMING | Riverton | 10905 | 1 | 8 | 4 | 34 | 52 | 521 | 37 |
| 9571 | West | WYOMING | Rock Springs | 24161 | 2 | 35 | 2 | 40 | 58 | 406 | 27 |
| 9572 | West | WYOMING | Saratoga | 1675 | 0 | 0 | 0 | 0 | 6 | 11 | 2 |
| 9573 | West | WYOMING | Sheridan | 17956 | 2 | 0 | 1 | 14 | 54 | 295 | 13 |
| 9574 | West | WYOMING | Sundance | 1289 | 0 | 1 | 0 | 0 | 0 | 14 | 0 |
| 9575 | West | WYOMING | Thermopolis | 2967 | 1 | 0 | 3 | 3 | 3 | 28 | 1 |
| 9576 | West | WYOMING | Torrington | 6676 | 0 | 0 | 0 | 25 | 19 | 77 | 3 |
| 9577 | West | WYOMING | Wheatland | 3665 | 0 | 0 | 0 | 8 | 13 | 47 | 3 |
| 9578 | West | WYOMING | Worland | 5348 | 0 | 3 | 0 | 3 | 18 | 43 | 1 |